Automatic Creation of a Conceptual Base for Portuguese using Clustering Techniques

نویسندگان

  • Hugo Gonçalo Oliveira
  • Paulo Gomes
چکیده

▶ Following [2]... 1. Split the original network into sub-networks 2. Calculate the frequency-weighted adjacency matrix F of each sub-network; 3. Fij = Fij + Fij ∗ , −0.5 < < 0.5; 4. Run MCL [3], with = 1.6, over F for 30 times; 5. Use the (hard) clustering from each run to create P, a matrix with the probabilities of each pair of words in F belonging to the same cluster; 6. Remove: (a) big clusters, B, if there is a group of clusters C = C1,C2, ...Cn such that B = C1 ∪ C2 ∪ ... ∪ Cn; (b) clusters completely included in other clusters.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Discovery of Fuzzy Synsets from Dictionary Definitions

In order to deal with ambiguity in natural language, it is common to organise words, according to their senses, in synsets, which are groups of synonymous words that can be seen as concepts. The manual creation of a broad-coverage synset base is a timeconsuming task, so we take advantage of dictionary definitions for extracting synonymy pairs and clustering for identifying synsets. Since word s...

متن کامل

Automatic Detection and Localization of Surface Cracks in Continuously Cast Hot Steel Slabs Using Digital Image Analysis Techniques

Quality inspection is an indispensable part of modern industrial manufacturing. Steel as a major industry requires constant surveillance and supervision through its various stages of production. Continuous casting is a critical step in the steel manufacturing process in which molten steel is solidified into a semi-finished product called slab. Once the slab is released from the casting unit, th...

متن کامل

Automatic Segmentation of the Gross Tumor Volume in Prostate Carcinoma Using Fuzzy Clustering in Gallium-68 PSMA PET/CT Scan

Introduction: Modern radiotherapy (RT) techniques allow a highly precise deposition of the radiation dose in tumor. So, high conformal tumor doses can be reached while sparing critical organs at risk. Materials and Methods: This study was conducted in three phases. In the first phase; Fourteen patients with primary or recurrent prostate cancer receive Gallium-...

متن کامل

Onto.PT: Automatic Construction of a Lexical Ontology for Portuguese

This ongoing research presents an alternative to the manual creation of lexical resources and proposes an approach towards the automatic construction of a lexical ontology for Portuguese. Textual sources are exploited in order to obtain a lexical network based on terms and, after clustering and mapping, a wordnet-like lexical ontology is created. At the end of the paper, current results are shown.

متن کامل

Improved Automatic Clustering Using a Multi-Objective Evolutionary Algorithm With New Validity measure and application to Credit Scoring

In data mining, clustering is one of the important issues for separation and classification with groups like unsupervised data. In this paper, an attempt has been made to improve and optimize the application of clustering heuristic methods such as Genetic, PSO algorithm, Artificial bee colony algorithm, Harmony Search algorithm and Differential Evolution on the unlabeled data of an Iranian bank...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010